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Aims To validate the European Heart Rhythm Association (EHRA) symptom classification in atrial fibrillation (AF) and test 

whether its discriminative ability could be improved by a simple modification. 

Methods We compared the EHRA classification with three quality of life (QoL) measures: the AF-specific Atrial Fibrillation Effect 

and results on QualiTy-of-life (AFEQT) questionnaire; two components of the EQ-5D instrument, a health-related utility which can 

be used to calculate cost-effectiveness, and the visual analogue scale (VAS) which demonstrates patients' own assessment 
of health status. We then proposed a simple modification [modified EHRA (mEHRA)] to improve discrimination at the 
point where major treatment decisions are made, quality of life data and clinician-allocated EHRA class were prospect- 
ively collected on 362 patients with AF. A step-wise, negative association was seen between the EHRA class and both the 
AFEQT and the VAS scores. Health-related utility was only significantly different between Classes 2 and 3 (P < 0.001). 
We developed and validated the mEHRA score separating Class 2 (symptomatic AF not limiting daily activities), based on 
whether the patients were 'troubled by their AF' (Class 2b) or not (Class 2a). This produced two distinct groups with 
lower AFEQT and VAS scores and, importantly, both clinically and statistically significant lower health utility (Autility 
0.9, P = 0.01) in Class 2b than Class 2a. 

Conclusion Based on patients' own assessment of their health status and the disease-specific AFEQT, the EHRA score can be considered 

a useful semi-quantitative classification. The mEHRA score has a clearer separation in health utility to assess the cost efficacy 
of interventions such as ablation, where Class 2b symptoms appear to be the appropriate treatment threshold. 

Keywords Atrial fibrillation • Symptoms • Quality of life • Symptom score • EHRA 



Introduction 

Atrial fibrillation (AF) is an emerging medical epidemic affecting 1 — 
2% of the general population and up to 15% of elderly patients. 1-4 



As well as doubling mortality rate, there is a significant effect on mor- 
bidity due to symptoms such as palpitation, fatigue, dyspnoea, and ex- 
ercise intolerance. 5,6 The resultant impact on quality of life (QoL) is as 
dramatic as for patients with severe structural heart disease. 7 Up to 
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What's new? 

• The EHRA score has been validated as a semi-quantitative 
measure of AF related symptoms and patients' perception of 
their general state of health. 

• Asimple modification ofthe EHRA score (the mEHRA score) 
provides better discrimination for patients with mild to mod- 
erate symptoms. 

• Patients with mild symptoms, which the patient does not find 
troublesome, have a Health-related Quality of Life compar- 
able to asymptomatic patients suggesting they may not be ap- 
propriate candidates for intervention. 



one-third of patients present with limiting symptoms, and inpatient 
care is a major consumer of healthcare resources. 8 

In 2007, an expert group ofthe German Atrial Fibrillation Compe- 
tence NETwork (AFNET) and the European Heart Rhythm Associ- 
ation (EHRA) published recommendations for the conduct of 
clinical trials in AF. 9 Noting that no accepted and easily applicable 
measure for the AF-related symptoms exists, this group of experts 
proposed and described in their recommendations a new scoring 
system, the EHRA classification, to assess and quantify symptoms 
related to AF. 9 The EHRA classification is based on the impact of 
symptoms on daily activity during presumed episodes of AF. It is 
simple to use and has a format similar to the New York Heart Asso- 
ciation (NYHA) symptom classification system for patients with 
heart failure, making it relatively intuitive for practicing clinicians 
while hopefully being more appropriate to AF-related symptoms. 10 
When the EHRA score was published, the authors made a specific 
mention of the need for validation. 9 A similar score was thereafter 
validated by a Canadian group. 6 Since its initial proposal, the EHRA 
score has entered widespread use and has even been used in the 
European Society of Cardiology guidelines for the management of 
AF in the recommendations for rate and rhythm control. 11 ~ 16 

To put the EHRA score into context with existing tools to assess 
disease-related QoL in AF patients, we compared the EHRA score 
with accepted and validated measures of health-related QoL. Specif- 
ically, we used both a disease specific tool, the Atrial Fibrillation Effect 
on QualiTy-of-life (AFEQT) questionnaire, and a general tool, the 
very well-established EQ-5D questionnaire. 17-19 The AFEQT is 
very sensitive to changes in symptom burden but cannot be com- 
pared with other conditions. EQ-5D, however, is applicable to a 
wide range of health conditions and provides a single index value 
for health status, called the health utility, which can be used to calcu- 
late 'quality-adjusted life years' (QALYs) for health economic evalu- 
ation. 20 Previous studies have suggested that the minimal meaningful 
difference in the EQ-5D-derived health utility is 0.07. 20 In line with 
this, a previous study of the cost-effectiveness of catheter ablation 
in the UK setting found catheter ablation to be cost-effective with 
an estimated utility difference of 0.09 between symptomatic AF on 
drugs compared with sinus rhythm following ablation. 21 

We aimed to validate the EHRA score by using these general and 
disease-specific QoL measures. We hypothesized that the discrim- 
inative power of the EHRA score could be improved and attempted 
to achieve this through a simple modification. 



Table I Modified EHRA (mEHRA) classification 



mEHRA 

score 


Symptoms 


Description 


1 


None 




2a 


Mild 


Normal daily activity not affected, 






symptoms not troublesome to 






patient 


2b 


Moderate 


Normal daily activity not affected 






but patient troubled by symptoms 


3 


Severe 


Normal daily activity affected 


4 


Disabling 


Normal daily activity discontinued 



Underlined text represents the modification to the original descriptions of EHRA 
classes. 



Methods 

Phase 1 

Consecutive patients with a diagnosis of AF attending designated heart 
rhythm/electrophysiology clinics at a single specialist cardiac hospital in 
England (Liverpool Heart and Chest Hospital) were invited to complete 
the AFEQT questionnaire and the EQ-5D instrument, including the 
visual analogue scale (VAS). 18,19 At the same clinical visit, the reviewing 
clinician was asked to independently score the patient according to the 
EHRA symptom classification. In keeping with real-world practice, 
these assessments were completed by a range of clinicians experienced 
in the management of patients with AF, including consultant cardiologists, 
trainee physicians, and arrhythmia nurses. Clinicians were provided with, 
and requested to complete, a classification form that listed the published 
definitions for each class. To replicate how the classification is likely to be 
used in routine practice, no specific training was provided beyond that 
described above and access to the original publication in which the 
EHRA score was proposed. This mirrored the methods used to validate 
an alternative classification system. 6 

Quality of life was assessed by the AFEQT score (global), the 
health-related utility based on the EQ-5D instrument, and the VAS. For 
each measure, a higher score represents a higher QoL. AFEQT and VAS 
are scored from 0 to 100. Health utility (EQ-5D) ranges between 1 
(perfect health) through 0 (death) to —0.59 (worse than death). Mean 
QoL scores were compared between neighbouring EHRA classes to 
assess the classification system's accuracy in semi-quantifying QoL. 

Phase 2 

We proposed a modified EHRA (mEHRA) classification by subdividing 
Class 2 into Classes 2a (mild) and 2b (moderate) according to the 
degree to which the patient was 'troubled by their symptoms' 
(Table 1). All the patients categorized as Class 2 during Phase 1 of the 
study were independently re-categorized as either Class 2a or 2b by 
two clinicians (electrophysiology fellows) who were both blinded to 
the corresponding QoL scores. Clinical letters were reviewed with spe- 
cific attention made to the extent to which the patients appeared 
'troubled by their symptoms', given that their daily activities were not 
affected (which would indicate Class 3 symptoms). We sought to 
assess the patient's own perception ofthe impact of AF on their well- 
being. Those suffering from anxiety, loss of confidence, or symptoms 
that they found unpleasant were graded as Class 2b. There was agree- 
ment between the reviewers for all the cases reclassified into mEHRA 
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Classes 2a and 2b. Where the clinical letters were lacking in detail, the 
hospital records were re-reviewed and a consensus agreed upon. The 
mEHRA score was assessed and validated using the same methods 
used to validate the EHRA score in the initial phase of the study. 

Phase 3 

Having completed Phases 1 and 2, we wished to verify and validate the 
findings from our retrospective scoring of the mEHRA score, by compar- 
ing ourfindings with an independent cohort in whom the mEHRA score 
had been used prospectively. The method of data collection forthis phase 
matched exactly with that in Phase 1,exceptthatthe clinicians were asked 
to classify symptoms according to the new, expanded mEHRA score 
rather than the original EHRA score. We compared the prospective 
and the retrospective scores, for the two new mEHRA classes, for 
each of the three QoL measures used in the previous two phases. 

Atrial Fibrillation Effect on QualiTy-of-Life 
questionnaire 

The AFEQT questionnaire is a validated, disease-specific, self-administered 
QoL instrument. 17 It has 20 questions with four conceptual domains: 
symptoms (four questions specifically targeted to assess AF-related 
symptoms), Treatment Concerns (six questions that assess AF treatment 
concerns in patients), Daily Activities (eight questions that evaluate daily 
function in AF patients), and Treatment Satisfaction (two questions asking 
about how well the current treatment controls their AF and relieves 
symptoms). Each of the 20 questions is marked on a 7-point Likert 
scale. A published algorithm exists to allow the calculation of a score, 
between 0 and 100 (where higher is better), for each domain and a 
global score based on the first three domains. The questionnaire specifies 
a recall period of the preceding 4 weeks. 

EQ-5D questionnaire (Three-level version) 

EQ-5D is a standardized measure of health status developed by the 
EuroQol Group to provide a simple, generic measure of health for clinical 
and economic appraisal. 18 Two hundred and forty-three possible health 
states can be defined from the five questions and these can be converted 
into a single summary index called the 'health utility' based on country- 



specific value sets. The health utility measure is of particular interest 
as it is generalizable to other diseases and to the general population. 
Therefore, it allows the calculation of QALYs and thus the cost- 
effectiveness of interventions such as catheter ablation. 

Visual analogue scale 

The VAS forms an integral but distinct part of the EQ-5D instrument. 
Respondents are asked to mark a single point on a linear scale that repre- 
sents their health status on the day of completion. The scale extends from 
0 (worst imaginable health state) to 100 (best imaginable health state). 
The VAS provides a quantitative measure of state of health, as judged 
by the individual patients. 

Statistical analysis 

Analyses were performed using StatsDirect software version 2.7.8. Ana- 
lysis of QoL scores (AFEQT score, EQ-5D-derived utility, and VAS) using 
the Shapiro- Wilk Wtest showed non-Gaussian distribution that was not 
corrected by logarithmic transformation. However, the central limit 
theorem is generally taken to imply that an assumption of normality is 
not necessary for parametric testing to be valid if the group sizes are 
greater than about 30. This is the case for all but one group in our analysis. 
In addition, presentation of the data as means with standard deviation was 
adjudged to be of greater clinical relevance than presentation of medians 
with the inter-quartile ranges. Continuous data are therefore presented 
as mean + standard deviation and compared using the t-test. A similar 
approach has been used in previous work using QoL measures. 23 A two- 
tailed test was used where there was no a priori expectation of the direc- 
tion of the difference between the groups. However, where a compari- 
son was made between two adjacent groups, resulting in assessment 
for a difference in a single direction only, one-tailed testing was consid- 
ered more appropriate. Analysis of variance was compared by using 
one-way analysis of variance (ANOVA). Trend across groups was 
assessed by using Cuzick's test. Proportions were compared by using 
the \ 2 statistic. Intra- and inter-observer variability were calculated, 
and assessed by using the k statistic. A P value of 0.05 or lower was con- 
sidered significant for all tests. 



Table 2 Baseline characteristics by EHRA class 





EHRA 1 


EHRA 2 


EHRA 3 


EHRA 4 


Total 


P value for trend 


N, number 


149 


99 


90 


24 


362 


n/a 


Age (mean + SD) 


61.1 (±11.4) 


59.7 ( + 12.2) 


57.9 ( + 13.0) 


62.2 ( + 9.9) 


60.0 ( + 12.0) 


NC 


Male gender 


70.3% 


58.6% 


73.0% 


62.5% 


66.9% 


NC 


% PAF a 


49.0% 


65.6% 


62.2% 


41.7% 


56.4% 


0.36 


Diabetes 


6.8% 


7.1% 


8.8% 


20.0% 


8.3% 


NC 


Heart failure 


5.4% 


3.0% 


7.7% 


12.0% 


5.8% 


NC 


Hypertension 


54.4% 


36.4% 


46.2% 


72.0% 


48.6% 


0.003 


Pacemaker 


5.4% 


7.1% 


9.9% 


12.0% 


7.5% 


NC 


COPD 


4.8% 


6.1% 


8.8% 


4.0% 


6.1% 


NC 


Previous stroke/TIA 


8.2% 


6.1% 


13.2% 


12.0% 


9.1% 


NC 


Prior ablation 


77.6% 


52.5% 


25.3% 


24.0% 


53.9% 


<0.0001 


Subsequent ablation 


6.8% 


34.3% 


61.5% 


76.0% 


32.9% 


<0.0001 



NC, not calculated (where ANOVA revealed no significant variance between groups, a test for trend was not performed). 
Predominant pattern at time of assessment. 

COPD, chronic obstructive pulmonary disease; TIA, transient ischaemic attack; PAF, paroxysmal atrial fibrillation. The term 'ablation' is used to denote catheter ablation of atrial 
fibrillation (e.g. pulmonary vein isolation). 
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The study was performed as part of a wider institutionally approved 
patient reported outcome measures service improvement programme 
at the recruiting centre. 

Results 

Quality of life and symptom data were collected on 362 patients 
attending the heart rhythm clinics during 201 2. All patients received 
physician-allocated EHRA classification at the same clinical visit. The 
baseline characteristics are given in Table 2. There were no clear dif- 
ferences between the groups in terms of age, sex, or proportion clas- 
sified as having paroxysmal, as opposed to persistent or permanent, 
AF. Previous studies have found QoL to be lower in females. 24 
However, in our cohort, patient gender did not have a significant 
effect on any of the three QoL measures (EQ-5D P = 0.56, VAS 
P = 0.70, and AFEQT P = 0.14). Hypertension was by far the most 
common comorbid condition (48.6% of all patients) and showed a 



significant trend towards increasing prevalence in higher EHRA 
classes (P = 0.003). Other comorbidities were infrequently 
present and were similar across classes. However, there was a 
clear trend seen whereby those in less severe symptom classes 
were the most likely to have previously undergone ablation and 
those in the most severe classes were considerably more likely to 
subsequently go on to have an ablation for atrial fibrillation. (P < 
0.0001 for both). A small proportion of asymptomatic patients 
went on to have an ablation in keeping with the published data 
from the EUR Observation study where 1 3% of patients were asymp- 
tomatic, citing a desire for a drug free lifestyle, improved QoL, and/or 
the maintenance of sinus rhythm. 25 

Phase 1 

Results forthe three QoL measures are shown in Tableland figure J. 
Analysis of variance and regression analysis confirmed significant 



Table 3 Mean (and standard deviation) shown for each EHRA class 



EHRA class 


Utility (by EQ-5D) 


P value 


VAS 


P value 


AFEQT 


P value 


1 


0.85 ( + 0.21) 


n/a 


76.2 ( + 19.9) 


n/a 


78.4 ( + 19.0) 


n/a 


2 


0.81 ( + 0.17) 


0.08 


70.3 (±20.3) 


0.02 


63.6 ( + 20.0) 


<0.0001 


3 


0.69 ( + 0.27) 


<0.001 


59.6 ( + 21.9) 


< 0.001 


42.1 ( + 21.1) 


<0.0001 


4 


0.59 ( + 0.29) 


0.08 


46.9 ( + 25.9) 


0.03 


31.3 (+18.6) 


0.01 



P values compare each class with the next lowest class in terms of symptom severity. 
VAS, visual analogue scale; AFEQT, AFEQT global score. 



Quality of life scoring by EHRA classification 




Mean + standard deviation 




1471 




EQ-5D score 



EQ VAS 



AFEQT global 



100 

90 

80 

70 

60 

50 

40 

30 

20 

10 

0 



EHRA 1 



EHRA 2 ■ EHRA 3 ■ EHRA 4 



Figure I Quality of life (QoL) scores (mean + standard deviation) by EHRA class. 
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negative correlation between the EHRA class and QoLas assessed by 
all three measures. To determine the ability of the EHRA classifica- 
tion as a semi-quantitative tool, each EHRA class was compared 
with the class immediately below in terms of QoL (i.e. EHRA 2 
with EHRA 1, EHRA 3 with EHRA 2, and EHRA 4 with EHRA 3). 
On using the disease-specific AFEQT score, significant differences 
were seen at each grade boundary suggesting that the EHRA score 
was an effective means of categorizing patients' symptoms. Similarly, 
on using the patient-based VAS, there was a significant difference 
between each and its immediate neighbour suggesting that the 
EHRA score effectively categorized patients in terms of their own as- 
sessment of their health state. However, when comparing the 
health-related utility, derived from the EQ-5D questionnaire, al- 
though there was a significant difference of 0.12 (P < 0.001) 



between Classes 2 and 3, the difference between Classes 1 and 2 
was only 0.04 (P = 0.08). This observation prompted us to develop 
the mEHRA classification. 

Phase 2 

Of the 99 patients originally classified as EHRA Class 2, 90 had suffi- 
cient detail in their archived clinical letters to be reclassified in ac- 
cordance with the proposed mEHRA classification into either 
Class 2a or 2b. Of these, 43 were classified as Class 2a and 47 were 
classified as Class 2b. The subdivision of Class 2 into Classes 2a and 
2b, resulted in clearly separate groups, with the Class 2a patients 
having AFEQT scores and health utilities much closer to the Class 
1 patients and the Class 2b patients having significantly more symp- 
toms as judged by the AFEQT and a significant reduction in the 



Table 4 Mean (and standard deviation) shown for each mEHRA class 





mEHRA class 


Utility (by EQ-5D) 


P value 


VAS 


P value 


AFEQT 


P value 




1 


0.85 ( + 0.21) 


n/a 


76.2 ( + 19.9) 


n/a 


78.4 (±19.0) 


n/a 




2a 


0.86 ( + 0.18) 


0.41 


75.6 ( + 19.9) 


0.43 


70.9 (±19.8) 


0.01 




2b 


0.77 ( + 0.15) 


0.01 


65.2 ( + 20.1) 


0.01 


58.3 (±17.3) 


<0.001 




3 


0.69 ( + 0.27) 


0.02 


59.6 ( + 21.9) 


0.09 


42.1 (±21.1) 


<0.0001 




4 


0.59 ( + 0.29) 


0.08 


46.9 ( + 25.9) 


0.03 


31.3 (±18.6) 


0.01 



P values compare each class with the next lowest class in terms of symptom seventy. 
VAS, visual analogue scale; AFEQT, AFEQT global score. 




Figure2 Quality of life (QoL) scores (mean ± standard deviation) by mEHRA class. Classes 1, 3, and 4 are as for figure 1. Class 2 has been split into 
Classes 2a (n = 43) and 2b (n = 47). 
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90 



80 



70 



AFEQT domains by mEHRA classification 



Mean + standard deviation 




731 



661 



it 



1 44 1 



36 



a 



1 40 1 



27 



1 44 1 



34 



Symptoms 



Activities 



Concerns 



mEHRA 1 ■ mEHRA 2a ■ mEHRA 2b ■ mEHRA 3 □ mEHRA 4 □ 



Figure 3 AFEQT sub-domain scores (mean + standard deviation) by mEHRA class. 



health utility as judged by the EQ-5D.The results are shown in Table 4 
and figure 2. 

We also analysed the mEHRA class accordingto the three subdo- 
mains of the AFEQT score: Symptoms, Activities, and Concerns. As 
can be seen in figure 3, there is a clear step-wise trend to lower scores 
as the mEHRA class increases. As these components of the AFEQT 
global score are not themselves individually validated we did not 
seek to analyse these on a class-by-class basis. However, the 
ANOVA confirmed a highly significant difference between the 
groups for all three subdomains (P < 0.0001 for each) and Cuzick's 
trend test showed a strong, and highly significant, trend towards 
lower scores with increasing mEHRA class (P < 0.0001 for each of 
the three subdomains). 

Reproducibility of the modified European 
Heart Rhythm Association class 

To assess the reproducibility of the mEHRA score we measured both 
intra-observer and inter-observer variability. Agreement between 
the two assessors for the ratings of Class 2a or 2b was very good 
at 83.2% (k 0.70, 95% confidence interval (CI) = 0.53-0.87). Inter- 
observer variability was assessed by asking each assessor to re- 
classify a random sample of 20 of the original clinical letters used 
for Phase 2 of the study. This was performed after an interval of 
several months to avoid bias due to recall of the previous classifica- 
tion. This demonstrated excellent repeatability, with an agreement 
between the original and the repeat classification of 90% for one as- 
sessor and 95% for the other (combined k 0.85, 95% CI for k, 0.54- 
1.16). 



Phase 3: prospective validation of the 
modified European Heart Rhythm 
Association 

The m EH PA score was thereafter prospectively applied to a second 
cohort of patients attending the Heart Rhythm Clinics at Liverpool 
Heart and Chest Hospital. By using the modified scoring system, 
165 patients were classified as either Class 2a (n = 85) or 2b (n = 
80). These data are shown in Table 5. Prospective scoring showed 
the same pattern as retrospective scoring with significantly lower 
AFEQT, EQ-5D, and VAS scores for mEHRA Class 2b than Class 
2a. There were no significant differences between any of the three 
QoL measures between this validation cohort and the initial 
cohort in either of the two new mEHRA classes (2a and 2b). 

Discussion 

This comparison of the EHRA symptoms classification with one 
disease-specific QoL instrument (AFEQT) and another general 
measure for the health-related QoL (EQ-5D, incorporating the 
VAS) provides good evidence that the EHRA score can be used to 
assess AF-related symptoms without prior training. It is easily 
applied in clinical practice and has the potential to be useful in clinical 
trials. However, it does not discriminate sufficiently in patients with 
low-level symptoms in terms of the health-related utility. We 
found that the EHRA classification can be improved by subdividing 
Class 2 into two separate classes (2a and 2b). 

Phase 1 of this study provides evidence that the EHRA classifica- 
tion, in its originally proposed format correlates well with the 
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Table 5 Comparison of retrospective and prospective QoL scores, for each of the two proposed additional mEHRA classes 



QoL measure 




Retrospective 
n = 90 


Prospective 
n = 165 


P value 

Retrospective vs. prospective 


Utility (EQ-5D) 


2a 


0.86 ( + 0.18) 


0.81 ( + 0.22) 


0.23 




2b 


0.77 ( + 0.1 5) 


0.72 ( + 0.22) 


0.25 


P value 2a vs. 2b 




P = 0.01 


P < 0.0001 




VAS 


2a 


75.6 ( + 19.9) 


77.9 ( + 15.9) 


0.51 




2b 


65.2 ( + 20.1) 


67.0 ( + 16.4) 


0.60 


P value 2a vs. 2b 




P = 0.01 


P= 0.009 




AFEQT 


2a 


70.9 ( + 19.8) 


67.7 ( + 22.2) 


0.42 




2b 


58.3 ( + 17.3) 


54.1 ( + 20.2) 


0.25 


P value 2a vs. 2b 




P < 0.001 


P < 0.0001 





disease-related QoL, as judged by the AFEQT questionnaire and with 
the patients' own perception of their health state (VAS). There is a 
step-wise, negative association between the EHRA class and both 
of these measures. The EHRA score can therefore be considered 
as a validated tool for symptom classification. In a similar manner to 
the NYHA functional class for heart failure and the Canadian Cardio- 
vascular Society angina scale, the EHRA score allows clinicians to 
broadly categorize the severity of patients' symptoms. Where the 
specifics and the complexities of symptoms can be considerable, 
this sort of categorization provides a simple means of communicating 
and quantifying symptom severity. It allows a cross-sectional com- 
parison between patients and a longitudinal comparison for individual 
patients or groups of patients. 

However, using health-related utility as a measure of QoL, the 
EHRA score only showed significant discriminatory power at the 
boundary between mild (Class 2) and severe (Class 3) symptoms. 
To try to improve the discriminative ability for patients with mild- 
moderate symptoms, we subdivided the EHRA Class 2 patients 
into Classes 2a and 2b, based on the degree to which the patients 
were 'troubled by their symptoms', and found that the two subdivi- 
sions were significantly different from each other on all three QoL 
measures. Indeed, the health-related utility showed no significant dif- 
ference between Class 1 (asymptomatic patients) and Class 2a 
(patients with symptoms but which are not troublesome and do 
not affect daily activity). Class 2b patients showed a significant reduc- 
tion (0.09) in health utility compared with Class 2a patients (Table 4). 
As such it may be more appropriate to base the treatment decisions 
not simply on the presence or the absence of symptoms, but also on 
whether the symptoms cause trouble to the patient or not. 

At first sight, the EQ-5D questionnaire does not intuitively repre- 
sent a sensitive QoL assessment for a patient with AF. The value in its 
use, however, is the assessment of a health utility score at baseline, 
which can then be reassessed following treatment. This allows calcu- 
lation of the cost-efficacy of the intervention, and thereby a compari- 
son of medical interventions between different disciplines. There is a 
growing demand in all countries to understand the cost-effectiveness 
of treatments used for common conditions to ensure the efficient 
and appropriate use of health resources. This is particularly the 
case when considering potentially costly treatments such as catheter 
ablation. In addition, there is an increasing focus on the patient 
reported health status as the most relevant outcome of medical 



interventions, and as clinicians we should embrace this method of 
assessing outcomes. 26 

The EQ-5D QoL questionnaire has previously been studied in AF, 
although not in association with the EHRA score. Berg et ol. 27 
reported the findings from the EQ-5D in the Euro Heart Survey. 
The EQ-5D was completed by 5050 patients attending specialist hos- 
pital departments in 35 European countries. The population studied 
was somewhat olderthan ours with a mean age of 66 years. The mean 
utility was 0.75, which would suggest that most patients had a 
symptom level around mEHRA Class 2b. A repeat survey after 1 
year completed by 3045 patients showed only a minor (0.013) im- 
provement in the health utility. A lower health utility was associated 
with AF-specific symptoms, but also other variables including increas- 
ing age, history of stroke, and the inability to take regular exercise. Only 
2.5% of the patients enroled into the Euro Heart Survey received treat- 
ment with catheter ablation. 28 A sub-analysis of the Birmingham Atrial 
Fibrillation Treatment of the Aged (BAFTA) study, looking at an older 
population (mean age 82, range 75-99) showed a comparable base- 
line utility for males (0.77) but a lower utility for females (0.68). 24 

In assessing interventions, doctors often focus on observable 
events, such as the frequency and the duration of AF episodes, 
which may not tally with the patients' perspective of their QoL. 7 ' 29 
The motivation of a patient to seek treatment, however, is based 
on the hope that symptoms will improve. We propose that the re- 
classification of EHRA Class 2 into Classes 2a and 2b can be of clinical 
use in selected patients with moderate AF-related symptoms. The re- 
classification of EHRA Class 2 is focused on the impact that AF has on 
the patient, by stating either that AF symptoms are 'not troublesome 
to patient' (Class 2a) or that the 'patient is troubled by symptoms' 
(Class 2b). This distinction will be intuitive for many clinicians, and 
appears as easily applicable as the original score. From our results, 
we can see that this clearly differentiates the two groups with not 
only a statistically significant difference in the health utility, but with 
an absolute difference of 0.09, a clinically meaningful difference. 

Limitations 

The EHRA score, and by extension the mEHRA score, is intended to 
be used for patients with either paroxysmal or non-paroxysmal AF 
and we did not differentiate between the two types in our analyses. 
It is conceivable that, within an individual mEHRA class, there is a dif- 
ference in the QoL between the two AF types. The subdivision of 
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EH RA Class 2 in Phase 2 was performed retrospectively by using clin- 
ical letters and could be considered subjective as it was based on a 
judgement of whether the patient was 'troubled by their symptoms'. 
This was however performed by two independent physicians, who 
were both blinded to the QoL scores of the patients in Phase 1 of 
the study. In a small minority of cases (n = 9), subdivision was not 
possible because of inadequate details in the clinical letter and there- 
fore these patients could not be included in Phase 2. By looking at and 
comparing a prospective cohort, we have shown that retrospective 
scoring is unlikely to have a significant effect on the conclusions 
drawn. In addition, although our assessments were made by a range 
of healthcare providers, all were experienced in the management 
of patients with AFand the population studied is from a specialist ter- 
tiary centre where a large proportion of patients are managed inva- 
sively. The classification may be less appropriately used in the 
hands of others groups of physicians or nurses or in other patient 
populations. Within these limitations, the mEHRA appears a practic- 
able and useful addition to the EHRA score. 

Conclusions 

The EHRA classification, as originally proposed, is a valid means of 
quantifying AF symptom severity and correlates well with AFEQT 
and with the generic EQ-5D. Subdividing EHRA Class 2 into 
Classes 2a and 2b by a single additional question has the potential 
to discriminate two clearly separate groups, with Class 2b patients 
having significantly impaired QoL due to AF, whereas those with 
Class 2a symptoms do not differ significantly from asymptomatic 
patients. This simple modification may further improve the clinical 
usefulness of the EHRA score, particularly when considering inter- 
ventions such as ablation, where Class 2b symptoms appear to be 
the appropriate treatment threshold. 
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